VU template DMP with FER [id: 7]

Organization VU Amsterdam
Created by Shuai Wang (shuai.wang.vu@gmail.com)
Based on VU template DMP with FER, 4.1.0 (shuaiwang2:vu-template-dmp-with-fer-v2:4.1.0)
Project Phase never
Created at 31 Oct 2023

Report generated by Data Stewardship Wizard <https://ds-wizard.org>

Summary Report

Summary

Answered (current phase) 28 / 28
Answered 28 / 28

I. Introduction

Thank you for participating in our study to explore the use of FAIR-Enabling Resources (FERs) as responses to Data Management Plan (DMP) questions. It will take 30-35 minutes to complete this DMP.

In this task, we will provide you with a DMP containing pre-populated answers to assist you in the task of finding the FER. The goal of this exercise is twofold:

  1. You have the option to use the suggestions provided in the DMP. These suggestions are decisions made by research communities and can serve as valuable starting points for your responses.
  2. Additionally, you should use the search bar to specify the FER. This demonstrates the efficiency and automation made possible by this machine-actionable feature. By using the search box, you can precisely locate the FER and make the result DMP more machine-processable.

This approach showcases the concept of machine-actionability, which is in contrast to the classical manual approach. Instead of solely relying on manual data stewardship, where all tasks are handled manually, you have the option to harness machine-assisted processes. You can either select responses from the provided suggestions or formulate your own answers. The key is to use the search bar effectively to pinpoint the FER, illustrating the enhanced efficiency and automation capabilities made possible by this feature.

Please make sure to keep track (or write it down on a piece of paper) of which questions you relied on the suggestions from FIPs when providing your answers. This information will be requested in the survey.

FAIR Data and Community Standards

Our main goal in this study is to see if using FERs as suggestions can make it easier for you to fill out DMPs. Additionally, we want to ensure that the data you generate follows the FAIR principles.

What is FAIR Data?

FAIR is an acronym that stands for Findable, Accessible, Interoperable, and Reusable.

  • Findable: FAIR data should be easy to locate and access by both humans and computers. This entails assigning unique identifiers (such as DOIs or URLs) and clear metadata to data sets.
  • Accessible: Once data is found, it should be readily accessible. This means that data should be stored in repositories that provide access to authorized users and ensure long-term preservation.
  • Interoperable: FAIR data should be structured in a way that allows it to work smoothly with other data. Standards and common data formats play a crucial role in achieving interoperability.
  • Reusable: To maximize the usefulness of data, it should be well-documented, explaining how others can use it. This involves providing comprehensive metadata and clear licensing terms for data reuse.
Community Standards and FAIRness

Community standards are essential in achieving FAIRness in data management. These standards are created collaboratively within specific research communities to ensure that data is collected, stored, and shared in a consistent and standardized way. When researchers follow these community standards, they help promote FAIR data practices within their field. In this study, we offer FERs from six social science communities to support this effort.

Study Overview

In this study, you will be presented with two distinct sections that encompass various aspects of data management and sharing within the social science domain. These sections will help us understand what you know, your views, and your choices when it comes to archiving and publishing data, documenting it, and managing resources for data.

Section 4: Data Archiving and Publishing (4 Questions)

In this section, you will find four questions related to data archiving and publishing. We will ask you about where and how you plan to publish your data. We are also interested in your choice of identifiers and licensing for your data assets.

Section 5: Documentation (3 Questions)

Section five focuses on the documentation needed for your project and data assets. We will inquire about metadata, methods, and tools necessary for accessing and using your data.

Feedback Survey

At the conclusion of your participation in this study, we kindly request you to provide us with your valuable feedback by filling out a short survey. Your feedback is instrumental in helping us understand your experience and gather insights that will assist us in refining our research methods and tools.

FAIR-Enabling Resources (FERs) Suggestions

To assist you in answering the questions in each section, we will provide suggestions for FAIR-Enabling Resources (FERs). These resources have been sourced from various FAIR Implementation Communities (FICs) in social science, which include:

  1. SSHOC-NL Socio Economic History (SEH)
  2. Media Content Analysis Lab (MCAL)
  3. LGBTQ+ Linked Open Vocabulary (LGBTQVoC)
  4. Social Science Survey Research (SSSR)
  5. European Social Survey (ESS)
  6. Australian Social Survey International-ESS (AUSSI-ESS)

These research communities were chosen because they were the only available ones and may be relevant to the researcher's DMP. These FERs are meant to serve as valuable references as you answer the questions in each section. You should both type your responses in the provided text fields and select relevant FERs from a dropdown menu for each question.

In case of trouble or questions, you can contact us on n.k2.singh@student.vu.nl.

Summary

Answered (current phase) 0 / 0
Answered 0 / 0

Questions

No questions

II. 4. Data archiving and publishing

Please make sure to keep track (or write it down on a piece of paper) of which questions you relied on the suggestions from FIPs when providing your answers. This information will be requested in the survey.

Summary

Answered (current phase) 16 / 16
Answered 16 / 16

Questions

1 4.6 Where will you publish your data assets?

Description

Data assets can only be shared in external repositories with which the VU has a processing agreement. Discuss with your Faculty’s Privacy Champion whether or not you can share data in an external repository. Do this as early as possible in your research; don’t wait until you are ready to share your data.

Suggestion from the FIPs you selected

You will first be given an option to describe your answer in text, and afterwards you can search using the search bar in case the FER is present there.

The following are the options suggested by the FIPs you selected:

| FERs | FIC | | ----- | ---- | | The Dataverse Project | SEH | | DANS Data Station SSH, The Dataverse Project, Figshare, Open Science Framework | MCAL | | GESIS Search | SSSR | | ESS Data Portal, EOSC Portal | ESS | | Australian Data Archive Dataverse FER | AUSSI-ESS |

Answers

1.a.1 In text, describe your answer

Data assest produced by the project will be shared via the Dataverse Project in compliance with rules and common practices.

1.a.2 Additionally, please specify the FER using by searching (ignore it if not found).

Answers

1.a.2.a.1

The Dataverse Project

The Dataverse is an open source web application to share, preserve, cite, explore and analyze research data. Researchers, data authors, publishers, data distributors, and affiliated institutions all receive appropriate credit via a data citation with a persistent identifier (e.g., DOI, or handle). A Dataverse repository hosts multiple dataverses. Each dataverse contains dataset(s) or other dataverses, and each dataset contains descriptive metadata and data files (including documentation and code that accompany the data). Dataverse is also installed in the countries of the European Union to preserve data collected by research communities of Netherlands, Germany, France and Finland. The largest Dataverse repository is called DataverseNL and located in the Netherlands providing data management services for 11 Dutch Universities.

2 4.8 How will you ensure your data assets get a persistent identifier (e.g. a DOI-code)?

Description

A persistent identifier (PID) is a durable reference to a digital dataset document, website or other object. You get a persistent identifier when you publish your data asset in a repository (e.g. DataverseNL/ Open Science Framework/ Zenodo). Note that if you store your data in ArchStor or DarkStor, you will receive a unique code that serves as a proxy for an identifier.

Suggestion from the FIPs you selected

You will first be given an option to describe your answer in text, and afterwards you can search using the search bar in case the FER is present there.

The following are the options suggested by the FIPs you selected:

| FERs | FIC | | ----- | ---- | | DOI, Handle System | SEH | | DOI | MCAL | | DOI | LGBTQVoC | | DOI, DDI URN | ESS | | DataCite DOI Resolution Service | AUSSI-ESS |

Answers

2.a.1 In text, describe your answer

Ensuring that the data assets have a persistent identifier will be accomplished by sharing the data via The Dataverse project which provides a persistent DOI to published data.

2.a.2 Additionally, please specify the FER using by searching (ignore it if not found).

Answers

2.a.2.a.1

DOI | Digital Object Identifier

The digital object identifier (DOI) system originated in a joint initiative of three trade associations in the publishing industry (International Publishers Association; International Association of Scientific, Technical and Medical Publishers; Association of American Publishers). The system was announced at the Frankfurt Book Fair 1997. The International DOI Foundation (IDF) was created to develop and manage the DOI system, also in 1997. The DOI system was adopted as International Standard ISO 26324 in 2012. The DOI system implements the Handle System and adds a number of new features. The DOI system provides an infrastructure for persistent unique identification of objects of any type. The DOI system is designed to work over the Internet. A DOI name is permanently assigned to an object to provide a resolvable persistent network link to current information about that object, including where the object, or information about it, can be found on the Internet. While information about an object can change over time, its DOI name will not change. A DOI name can be resolved within the DOI system to values of one or more types of data relating to the object identified by that DOI name, such as a URL, an e-mail address, other identifiers and descriptive metadata. The DOI system enables the construction of automated services and transactions. Applications of the DOI system include but are not limited to managing information and documentation location and access; managing metadata; facilitating electronic transactions; persistent unique identification of any form of any data; and commercial and non-commercial transactions. The content of an object associated with a DOI name is described unambiguously by DOI metadata, based on a structured extensible data model that enables the object to be associated with metadata of any desired degree of precision and granularity to support description and services. The data model supports interoperability between DOI applications. The scope of the DOI system is not defined by reference to the type of content (format, etc.) of the referent, but by reference to the functionalities it provides and the context of use. The DOI system provides, within networks of DOI applications, for unique identification, persistence, resolution, metadata and semantic interoperability.

3 4.9 Will you register your datasets in an online registry other than PURE? If yes, where?

Description

Registering your data assets in an online registry increases the findability of your work. According to the VU RDM policy, you should register your data assets in the PURE Research Portal, even if you cannot share the actual datasets themselves publicly due to legal, intellectual property, privacy- or security-related issues.

Suggestion from the FIPs you selected

You will first be given an option to describe your answer in text, and afterwards you can search using the search bar in case the FER is present there.

The following are the options suggested by the FIPs you selected:

| FERs | FIC | | ----- | ---- | | DOI, Handle System | SEH | | DOI | MCAL | | DOI | LGBTQVoC | | DOI, DDI URN | ESS | | DataCite DOI Resolution Service | AUSSI-ESS |

Answers

3.a.1 In text, describe your answer

Once the dataset is provided with a persistent identifier - DOI - via The Dataverse Project, I will make sure that it is recorded, with all relevant metadata on Crossref.

3.a.2 Additionally, please specify the FER using by searching (ignore it if not found).

Answers

3.a.2.a.1

Crossref (DOI)

*Crossref is a registration agency of the International DOI Foundation. Crossref provides a mechanism for identifying and describing research objects (books and chapters, components, conference proceedings, datasets, dissertations, grants, journals and articles, peer reviews, pending publications, posted content (includes preprints), reports and working papers, and standards). It follows the ISO/IEC 11179 Metadata Registry (MDR) standard, which specifies a schema for recording both the meaning and technical structure of the data for unambiguous usage by humans and computers. CrossRef uses a single deposit schema stored as XML, which supports a range of different content types and provides a structure and set of rules to keep everything consistent and interoperable. *

4 4.13 Please indicate the license and/ or terms of use under which you share your data.

Description

By specifying a license and defining the terms of use, you can manage the purposes for which your data assets can be used by others. Most repositories allow you to specify a license and terms of use for your shared data assets. For example, the standard license in the DataverseNL repository is Creative Commons Zero Waiver (others can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission), but it provides other license options.

Suggestion from the FIPs you selected

You will first be given an option to describe your answer in text, and afterwards you can search using the search bar in case the FER is present there.

The following are the options suggested by the FIPs you selected:

| FERs | FIC | | ----- | ---- | | CC BY SA 4.0, CC BY-NC 4.0 | SEH | | CC BY-NC-ND 4.0 | LGBTQVoC | | GESIS Usage Regulations 2018 | SSSR | | CC BY-NC-SA 4.0 | ESS |

Answers

4.a.1 In text, describe your answer

We plan to use the VU's default licensing option, which is CC-BY 4.0, for our datasets.

4.a.2 Additionally, please specify the FER using by searching (ignore it if not found).

III. 5. Documentation

Please make sure to keep track (or write it down on a piece of paper) of which questions you relied on the suggestions from FIPs when providing your answers. This information will be requested in the survey.

Summary

Answered (current phase) 12 / 12
Answered 12 / 12

Questions

1 5.1 What metadata and documentation will accompany the project?

Description

Documentation of your research is important for the reusability of your data assets. Consider what kinds of documentation you will produce during your research project. Examples may include codebooks or lab journals, read-me files, research logs, protocols.

Metadata are information about your data. Using a standard way of documenting your data makes it easier for other researchers to work with and reuse your data assets. Metadata standards exist for different research fields.

Suggestion from the FIPs you selected

You will first be given an option to describe your answer in text, and afterwards you can search using the search bar in case the FER is present there.

The following are the options suggested by the FIPs you selected:

| FERs | FIC | | ----- | ---- | | MARC21, EAD3, DDI-Codebook, DCAT2 | SEH | | MARC21, BIBFRAME | LGBTQVoC | | DDI-Codebook | SSSR | | DDI-Lifecycle 3.3, DDI-Codebook | ESS | | DDI-Codebook, DataCite Metadata Schema Version 3.1 | AUSSI-ESS |

Answers

1.a.1 In text, describe your answer

Relying on OSF standards of metadata documentation, the relevant information will be collected throughout the project and published in accordance with the project time-line. These include read-me.files, and process documentation

1.a.2 Additionally, please specify the FER using by searching (ignore it if not found).

Answers

1.a.2.a.1

This question has not been answered yet!

2 5.2 What metadata and documentation will accompany the data assets?

Description

Consider how you will document your research process (e.g. in cleaning/processing scripts, algorithms and code comments). Metadata standards may include data and variable naming conventions, file and folder structure, instruction files and versioning protocols for your data assets and documentation.

Suggestion from the FIPs you selected

You will first be given an option to describe your answer in text, and afterwards you can search using the search bar in case the FER is present there.

The following are the options suggested by the FIPs you selected:

| FERs | FIC | | ----- | ---- | | MARC21, EAD3, DDI-Codebook, DCAT2 | SEH | | MARC21, BIBFRAME | LGBTQVoC | | DDI-Codebook | SSSR | | DDI-Lifecycle 3.3 | ESS | | DDI-Codebook, DataCite Metadata Schema Version 3.1 | AUSSI-ESS |

Answers

2.a.1 In text, describe your answer

All relevant codebooks and code used in the construction of the datasets will be made available together with the published dataset. These will include the survey questions and range of answers, documentation of the sampling procedures, and handling of data between getting the survery results and processing them into a dataset.

2.a.2 Additionally, please specify the FER using by searching (ignore it if not found).

Answers

2.a.2.a.1

This question has not been answered yet!

3 5.3 What methods, software or hardware are needed to access and use your data?

Description

It is recommended that data assets be saved as much as is possible in non-proprietary (/non-commercial) formats. This makes it easier to re-use the data using all kinds of software depending on a researchers' needs. This is also in line with the general societal & funders increased focus towards Open Science. The national archive DANS has an overview of preferred formats. When necessary, indicate whether potential users need certain types of hardware/ computing power/ proprietary software/ etc to access and (re-)use the data. Also consider the sustainability of software needed for accessing the data (e.g. containerization of analysis code).

Suggestion from the FIPs you selected

You will first be given an option to describe your answer in text, and afterwards you can search using the search bar in case the FER is present there.

The following are the options suggested by the FIPs you selected:

| FERs | FIC | | ----- | ---- | | REST API, HTTPS, OAI-PMH, SWORD, CLARIN AAI | SEH | | HTTPS, REST API | MCAL | | REST API, HTTPS | LGBTQVoC | | HTTPS, GraphQL API, eduGAIN Interfederation Service, OpenID Connect, Azure Active Directory | ESS | | HTTPS, REST, SAML2 | AUSSI-ESS |

Answers

3.a.1 In text, describe your answer

The dataset will be saved in a .csv format that can be read by multiple statistical softwares commonly used in the analysis of survey data.

3.a.2 Additionally, please specify the FER using by searching (ignore it if not found).

Answers

3.a.2.a.1

**CSV File Format **

Files with .csv (Comma Separated Values) extension represent plain text files that contain records of data with comma separated values. Each line in a CSV file is a new record from the set of records contained in the file. Such files are generated when data transfer is intended from one storage system to another. Since all applications can recognize records separated by comma, import of such data files to database is done very conveniently. Almost all spreadsheet applications such as Microsoft Excel or OpenOffice Calc can import CSV without much effort. Data imported from such files is arranged in cells of a spreadsheet for representation to user.

IV. Feedback Survey

We kindly request your participation in a brief survey designed to gather your feedback on the use of FAIR-Enabling Resource (FER) suggestions in Data Management Plans (DMPs). Your valuable input will help us understand the effectiveness of this approach and make improvements if necessary.

Time Estimate: This survey should take approximately 15 minutes to complete.

Number of Questions: There are a total of 12 questions in this survey.

Survey Link: https://vuamsterdam.eu.qualtrics.com/jfe/form/SV_4NtIaaPX6avzR3M

Your participation and insights are highly appreciated. Thank you for contributing to our research efforts.

Summary

Answered (current phase) 0 / 0
Answered 0 / 0

Questions

No questions